Learning Tabletop Object Manipulation by Imitation

نویسندگان

  • Zhen Zeng
  • Benjamin Kuipers
چکیده

We aim to enable robot to learn tabletop object manipulation by imitation. Given external observations of demonstrations on object manipulations, we believe that two underlying problems to address in learning by imitation is 1) segment a given demonstration into skills that can be individually learned and reused, and 2) formulate the correct RL (Reinforcement Learning) problem that only considers the relevant aspects of each skill so that the policy for each skill can be effectively learned. Previous works made certain progress in this direction, but none has taken private information into account. The public information is the information that is available in the external observations of demonstration, and the private information is the information that are only available to the agent that executes the actions, such as tactile sensations. Our contribution is that we provide a method for the robot to automatically segment the demonstration into multiple skills, and formulate the correct RL problem for each skill, and automatically decide whether the private information is an important aspect of each skill based on interaction with the world. Our motivating example is for a real robot to play the shape sorter game by imitating other’s behavior, and we will show the results in a simulated 2D environment that captures the important properties of the shape sorter game. The evaluation is based on whether the demonstration is reasonably segmented, and whether the correct RL problems are formulated. In the end, we will show that robot can imitate the demonstrated behavior based on learned policies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SMILE: Simulator for Maryland Imitation Learning Environment

As robot imitation learning is beginning to replace conventional hand-coded approaches in programming robot behaviors, much work is focusing on learning from the actions of demonstrators. We hypothesize that in many situations, procedural tasks can be learned more effectively by observing object behaviors while completely ignoring the demonstrator’s motions. To support studying this hypothesis ...

متن کامل

A Developmental Approach to Goal-Based Imitation Learning in Robots

We propose a new developmental approach to goalbased imitation learning that allows a robot to: (1) learn probabilistic models of actions through self-discovery and experience, (2) utilize these learned models for inferring the goals of human demonstrations, and (3) perform goal-based imitation for humanrobot collaboration. Our approach is based on Meltzoff’s “Likeme” hypothesis in developmenta...

متن کامل

Learning Deep Policies for Physics-Based Manipulation in Clutter

Uncertainty in modeling real world physics makestransferring traditional open-loop motion planning techniquesfrom simulation to the real world particularly challenging.Available closed-loop policy learning approaches, for physics-based manipulation tasks, typically either focus on single objectmanipulation, or rely on imitation learning, which inherentlyconstrains task g...

متن کامل

A Bayesian Model of Imitation in Infants and Robots

Learning through imitation is a powerful and versatile method for acquiring new behaviors. In humans, a wide range of behaviors, from styles of social interaction to tool use, are passed from one generation to another through imitative learning. Although imitation evolved through Darwinian means, it achieves Lamarckian ends: it is a mechanism for the inheritance of acquired characteristics. Unl...

متن کامل

Learning object affordances by imitation Research Progress Report 3

The report explores the role of imitation in learning object affordances. Since I intend to experiment with a robotic arm enabling only pushing objects, properties of objects will be the main factor affecting the overall complexity of this “toy world”. Learning in the toy world is a process of discovering objects’ affordances, as well as the kinds of actions and goals related to them. I intend ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1603.00964  شماره 

صفحات  -

تاریخ انتشار 2016